16 research outputs found

    RecycleGPT: An Autoregressive Language Model with Recyclable Module

    Full text link
    Existing large language models have to run K times to generate a sequence of K tokens. In this paper, we present RecycleGPT, a generative language model with fast decoding speed by recycling pre-generated model states without running the whole model in multiple steps. Our approach relies on the observation that adjacent tokens in a sequence usually have strong correlations and the next token in a sequence can be reasonably guessed or inferred based on the preceding ones. Experiments and analysis demonstrate the effectiveness of our approach in lowering inference latency, achieving up to 1.4x speedup while preserving high performance.Comment: Technical Repor

    Large-Scale Automatic K-Means Clustering for Heterogeneous Many-Core Supercomputer

    Get PDF
    Funding: UK EPSRC grants ”Discovery” EP/P020631/1, ”ABC: Adaptive Brokerage for the Cloud” EP/R010528/1.This article presents an automatic k-means clustering solution targeting the Sunway TaihuLight supercomputer. We ïŹrst introduce a multilevel parallel partition approach that not only partitions by dataïŹ‚ow and centroid, but also by dimension, which unlocks the potential of the hierarchical parallelism in the heterogeneous many-core processor and the system architecture of the supercomputer. The parallel design is able to process large-scale clustering problems with up to 196,608 dimensions and over 160,000 targeting centroids, while maintaining high performance and high scalability. Furthermore, we propose an automatic hyper-parameter determination process for k-means clustering, by automatically generating and executing the clustering tasks with a set of candidate hyper-parameter, and then determining the optimal hyper-parameter using a proposed evaluation method. The proposed auto-clustering solution can not only achieve high performance and scalability for problems with massive high-dimensional data, but also support clustering without sufïŹcient prior knowledge for the number of targeted clusters, which can potentially increase the scope of k-means algorithm to new application areas.PostprintPeer reviewe

    Large-scale hierarchical k-means for heterogeneous many-core supercomputers

    Get PDF
    Funding: J.Thomson and T.Yu are supported by the EPSRC grants ”Discovery” EP/P020631/1, ”ABC: Adaptive Brokerage for the Cloud” EP/R010528/1, and EU Horizon 2020 grant Team-Play: ”Time, Energy and security Analysis for Multi/Many-core heterogenous PLAtforms” (ICT-779882, https://teamplay- h2020.eu)This paper presents a novel design and implementation of k-means clustering algorithm targeting the Sunway TaihuLight supercomputer. We introduce a multi-level parallel partition approach that not only partitions by dataflow and centroid, but also by dimension. Our multi-level (nkd) approach unlocks the potential of the hierarchical parallelism in the SW26010 heterogeneous many-core processor and the system architecture of the supercomputer. Our design is able to process large-scale clustering problems with up to 196,608 dimensions and over 160,000 targeting centroids, while maintaining high performance and high scalability, significantly improving the capability of k-means over previous approaches. The evaluation shows our implementation achieves performance of less than 18 seconds per iteration for a large-scale clustering case with 196,608 data dimensions and 2,000 centroids by applying 4,096 nodes (1,064,496 cores) in parallel, making k-means a more feasible solution for complex scenarios.Postprin

    Dynamic Analysis of a Large Deployable Space Truss Structure Considering Semi-Rigid Joints

    No full text
    Joints are widely used in large deployable structures but show semi-rigidity due to performance degradation and some nonlinear factors affecting the structure’s dynamic characteristics. This paper investigates the influence of semi-rigid joints on the characteristics of deployable structures in orbit. A virtual connection element of three DOFs is proposed to model the semi-rigid joints. The governing equations of semi-rigid joints are established and integrated into the dynamic equation of the structures. A series of numerical experiments are carried out to validate the proposed model’s accuracy and efficiency, and the deployable truss structures’ static and dynamic responses are analyzed. The results show that semi-rigid joints exacerbate the effects of an in-orbit microvibration on the stability of deployable truss structures. Semi-rigid joints lower the dominant frequencies of structures, leading to a ‘closely-spaced-frequencies’ phenomenon and altering the dynamic responses significantly. The effects of semi-rigid joints on deployable truss structures are long-term and can be used to establish a relationship model between structural performance and service life. Nonlinear effects vary with the external load and depend on the structures’ instantaneous status. These results indicate that semi-rigid joints significantly influence the characteristics of deployable structures, which must be considered in the design and analysis of high-precision in-orbit deployable structures

    Electrospun Zein/Polyoxyethylene Core-Sheath Ultrathin Fibers and Their Antibacterial Food Packaging Applications

    No full text
    The purpose of this work is to develop a novel ultrathin fibrous membrane with a core-sheath structure as antibacterial food packaging film. Coaxial electrospinning was exploited to create the core-sheath structure, by which the delivery regulation of the active substance was achieved. Resveratrol (RE) and silver nanoparticles (AgNPs) were loaded into electrospun zein/polyethylene oxide ultrathin fibers to ensure a synergistic antibacterial performance. Under the assessments of a scanning electron microscope and transmission electron microscope, the ultrathin fiber was demonstrated to have a fine linear morphology, smooth surface and obvious core-sheath structure. X-ray diffraction and Fourier transform infrared analyses showed that RE and AgNPs coexisted in the ultrathin fibers and had good compatibility with the polymeric matrices. The water contact angle experiments were conducted to evaluate the hydrophilicity and hygroscopicity of the fibers. In vitro dissolution tests revealed that RE was released in a sustained manner. In the antibacterial experiments against Staphylococcus aureus and Escherichia coli, the diameters of the inhibition zone of the fiber were 8.89 ± 0.09 mm and 7.26 ± 0.10 mm, respectively. Finally, cherry tomatoes were selected as the packaging object and packed with fiber films. In a practical application, the fiber films effectively reduced the bacteria and decreased the quality loss of cherry tomatoes, thereby prolonging the fresh-keeping period of cherry tomatoes to 12 days. Following the protocols reported here, many new food packaging films can be similarly developed in the future

    Clinical efficacy and safety of platelet-rich plasma in arthroscopic full-thickness rotator cuff repair: A meta-analysis.

    No full text
    BackgroundArthroscopic repair of rotator cuff tears, although commonly performed, carries the risk of retears. Therefore, bioremediation techniques such as platelet-rich plasma injections have been used as adjuvant therapies. The clinical efficacy of platelet-rich plasma in the arthroscopic repair of full-thickness rotator cuff injury is controversial. We performed a meta-analysis to evaluate the clinical effectiveness and safety of platelet-rich plasma and provide evidence-based medical recommendations for selecting the proper clinical treatment plan for full-thickness rotator cuff injuries.MethodsA search for the terms "platelet-rich plasma" and "rotator cuff" was performed in the PubMed, EMBASE, and Cochrane Library databases using a computer. After conducting quality evaluations and data extraction, RevMan 5.3 software was used to combine the effect sizes, and the GRADEpro Guideline Development Tool was used to rate the level of evidence from aspects of functional score, pain score and retear rate.ResultsEight randomized controlled trials involving 566 patients were included. The long-term retear rate(RR = 0.96, 95% CI [0.52, 1.78], P = .89), Constant score(RR = 0.96, 95% CI [0.52, 1.78], P = .89), and Visual Analog Scale score for pain (SMD = -0.28, 95% CI [-0.60, 0.04], P = .08), as well as both the long-term and short-term Disabilities of the Arm, Shoulder, and Hand scores(SMD = -0.13, 95% CI [-0.44, 0.18], P = .41;SMD = -0.02, 95% CI [-0.40, 0.36], P = .93), were not significantly different between the platelet-rich plasma and control groups. However, the short-term retear rate(RR = 0.29, 95% CI [0.13, 0.65], P = .003) and Visual Analog Scale score (SMD = -0.41, 95% CI [-0.62, -0.19], P = .0002) were significantly lower, while the short-term Constant score(SMD = 0.37, 95% CI [0.19, 0.55], P ConclusionPlatelet-rich plasma injection can effectively improve the short-term outcomes following arthroscopic repair of full-thickness rotator cuff tears, thus reducing the rate of retears, alleviating pain, and improving patients' shoulder function. Specifically, the clinical outcomes are better with the use of platelet-rich plasma in single-row fixation than in other fixation techniques. Therefore, platelet-rich plasma injection can be recommended as an adjuvant therapy in single-row repair for improved short-term results
    corecore